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Abstract 

In  the  reliability-based  design  optimization  (RBDO)  process,  surrogate  models  are  frequently  used  to  reduce  the 
number  of  simulations  because  analysis  of  a  simulation  model  takes  a  great  deal  of  computational  time.  On  the 
other  hand,  to  obtain  accurate  surrogate  models,  we  have  to  limit  the  dimension  of  the  RBDO  problem  and  thus 
mitigate  the  curse  of  dimensionality.  Therefore,  it  is  desirable  to  develop  an  efficient  and  effective  variable 
screening  method  for  reduction  of  the  dimension  of  the  RBDO  problem.  In  this  paper,  requirements  of  the  variable 
screening  method  for  deterministic  design  optimization  (DDO)  and  RBDO  are  compared,  and  it  is  found  that  output 
variance  is  critical  for  identifying  important  variables  in  the  RBDO  process.  An  efficient  approximation  method 
based  on  the  univariate  dimension  reduction  method  (DRM)  is  proposed  to  calculate  output  variance  efficiently.  For 
variable  screening,  the  variables  that  induce  larger  output  variances  are  selected  as  important  variables.  To 
determine  important  variables,  hypothesis  testing  is  used  in  this  paper  so  that  possible  errors  are  contained  in  a  user- 
specified  error  level.  Also,  an  appropriate  number  of  samples  is  proposed  for  calculating  the  output  variance. 
Moreover,  a  quadratic  interpolation  method  is  studied  in  detail  to  calculate  output  variance  efficiently.  Using 
numerical  examples,  performance  of  the  proposed  method  is  verified.  It  is  shown  that  the  proposed  method  finds 
important  variables  efficiently  and  effectively. 

Keywords:  Variable  Screening,  RBDO,  Surrogate  Model,  Output  Variance,  1-D  Surrogate  Model,  Partial  Output 
Variance,  Hypothesis  Testing,  Univariate  Dimension  Reduction  Method 


1.  Introduction 

The  variable  screening  method  is  a  useful  method  in  the  design  optimization  process  because  it  can  select 
essential  design  variables  for  accurate  surrogate  models  and  effective  design  optimization.  In  the  formulation  of  a 
design  optimization  problem,  a  set  of  design  variables  that  describe  the  system  need  to  be  identified  [1].  Design 
variables  are  selected  to  be  independent  of  each  other  as  much  as  possible  in  the  design  space.  The  number  of 
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independent  design  variables  is  known  as  the  degrees  of  freedom,  and  this  is  the  dimensionality  of  the  optimization 
problem.  To  obtain  an  appropriate  optimum  design,  a  minimum  number  of  design  variables  is  required.  For  this 
reason,  it  is  better  to  identify  as  many  design  variables  as  possible  and  then  fix  some  of  the  variables  at  certain 
values  according  to  the  variable  screening  result.  The  variable  screening  method  can  play  a  key  role,  especially  in 
reliability-based  design  optimization  (RBDO),  because  the  RBDO  process  requires  a  larger  number  of  analyses  than 
the  deterministic  design  optimization  (DDO)  process  due  to  reliability  analyses  and  the  design  sensitivities  of 
probabilities  of  failure.  To  this  end,  surrogate  models  are  usually  used  to  reduce  the  number  of  analyses  required  in 
RBDO.  Various  surrogate  model  methods  such  as  the  radial  basis  function  (RBF),  polynomial  response  surface 
(PRS),  support  vector  regression  (SVR),  Kriging,  and  dynamic  Kriging  (DKG)  methods  have  been  developed  [2-10]. 
However,  even  for  the  surrogate  model,  the  number  of  design  variables  becomes  a  critical  factor  because  surrogate 
model  generation  is  difficult  for  high-dimensional  problems,  due  to  the  curse  of  dimensionality. 

Variable  screening  methods  have  been  developed  in  various  disciplines.  In  statistics,  important  variables  were 
found  to  create  an  accurate  surrogate  model  of  computer  simulation  using  the  maximum  likelihood  estimator  (MLE) 
of  correlation  parameters  of  the  Gaussian  process  for  a  deterministic  problem  [11].  Using  a  regression  model, 
essential  variables  among  candidate  variables  were  efficiently  identified  based  on  data  [12,  13].  Especially  in 
statistical  learning  theory,  various  feature  selection  methods  have  been  developed  to  choose  a  reduced  number  of 
input  variables  to  represent  an  output  effectively  [14].  In  addition,  methods  such  as  manifold  learning  have  been 
used  to  preserve  input  information  in  reduced  dimension  for  efficient  statistical  analysis  [15].  In  physics,  a  variable 
screening  model  was  developed  for  the  quasi-molecular  treatment  of  ion-atom  collision  [16].  In  engineering,  a 
confidence  interval  of  the  coefficient  of  a  linear  surrogate  model  was  proposed  to  detect  key  variables  for  a  car  crash 
DDO  problem  [17].  A  sampling-based  sensitivity  measure  using  a  small  amount  of  data  was  introduced  to  rank  the 
importance  of  variables  and  was  applied  to  long-term  performance  of  a  geologic  repository  for  high-level 
radioactive  waste  [18].  Moreover,  the  design  sensitivity  method  can  be  extended  to  the  variable  screening  method 
because  vital  variables  have  larger  design  sensitivity.  In  the  deterministic  problem,  the  design  sensitivity,  which 
shows  the  rate  of  change  in  the  performance  measure  at  the  design  point,  can  be  obtained  using  various  methods  [19, 
20]  and  is  called  local  sensitivity  analysis  (LSA)  [21,  22].  For  a  reliability  problem,  the  variability  of  the  input 
random  variable  should  be  incorporated  to  assess  the  design  sensitivity  of  a  probabilistic  constraint.  The  design 
sensitivity  of  the  probabilistic  constraint  using  the  first-order  reliability  method  (FORM)  [23-25],  dimension 
reduction  method  (DRM)  [26,  27],  and  sampling-based  stochastic  sensitivity  [28,  29],  could  be  used  to  identify 
important  design  variables.  In  addition,  global  sensitivity  analysis  (GSA),  such  as  correlation  ratio  [30],  global 
sensitivity  indices  [31],  and  analytical  GSA  methods  [22],  can  be  used  for  variable  screening  as  well  [32]. 

However,  previous  works  may  have  limitations  to  being  directly  applied  to  RBDO  with  surrogate  models.  If  a 
method  depends  entirely  on  existing  data  [12-15],  it  may  not  be  possible  to  carry  out  RBDO  because  design 
variables  change  during  the  optimization  process.  Finding  input  variables  from  among  all  the  variables  that  may  be 
irrelevant  to  an  output  to  well  represent  the  output  from  data  [14]  is  not  an  issue  from  the  RBDO  perspective.  The 
relationship  and  relative  input  variables  are  already  known  through  the  computer  aided  engineering  (CAE)  such  as 
the  finite  element  method  (FEM)  or  computational  fluid  dynamics  (CFD).  A  method  using  CAE  to  find  variables 
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which  significantly  affect  output  reliability  is  more  interesting.  Moreover,  capturing  the  input  information  in 
reduced  variables  [15]  is  not  an  issue  with  RBDO,  either;  how  much  output  uncertainty  is  affected  by  the  input 
variables  is  the  main  interest.  A  method  developed  for  a  specific  problem  [16]  will  be  inadequate  for  broad 
applications.  Variable  screening  and  design  sensitivity  methods  for  a  deterministic  problem  [11,  17,  19,  20]  may  not 
be  applicable  for  RBDO  because  input  randomness  is  not  considered.  Methods  that  require  a  very  large  number  of 
analyses  [30,  31]  could  be  ineffective  for  RBDO  of  computationally  demanding  problems  and  could  become 
unstable  when  sufficient  numbers  of  analyses  are  not  provided  [18].  The  design  sensitivity  of  the  probabilistic 
constraint  using  FORM  or  DRM  [23-27]  requires  searching  for  the  most  probable  point  (MPP),  which  may  be  very 
difficult  to  obtain  for  a  large-dimensional  problem.  If  a  method  was  developed  based  on  the  assumption  that 
accurate  full-dimensional  surrogate  models  are  available  a  priori  [22,  28,  29],  RBDO  could  be  carried  out  using  the 
surrogate  models  because  it  can  provide  accurate  response  and  sensitivity  of  the  problem,  unless  the  optimization 
algorithm  has  a  limitation  on  the  number  of  design  variables,  which  is  not  common.  From  the  previous  works,  key 
desirable  properties  of  a  variable  screening  method  for  RBDO  with  a  surrogate  model  were  found:  it  should  (1)  be 
efficient,  (2)  consider  input  randomness,  (3)  not  require  a  full-dimensional  surrogate  model,  and  (4)  be  applicable  to 
broader  problems. 

Therefore,  the  objective  of  this  paper  is  to  develop  a  variable  screening  method  that  can  satisfy  the  above 
desirable  properties.  The  reliability  analysis  in  RBDO  captures  the  output  variability  induced  by  the  input 
variability  and  the  sensitivity  of  the  performance  function.  The  variable  that  induces  larger  output  variability  is 
important  in  the  RBDO  process.  In  this  paper,  a  partial  output  variance,  which  is  the  output  variance  when  one 
design  variable  has  variability  while  others  are  fixed  at  their  mean,  is  used  to  find  important  design  variables  [33]. 
The  partial  output  variance  is  simple  to  calculate  and  requires  a  1-D  surrogate  model  for  each  design  variable.  The 
method  introduced  in  this  paper  has  strengths  and  weaknesses.  Its  main  strength  is  its  efficiency  and  practical 
applicability.  Its  weakness  is  accuracy;  the  interactions  between  the  design  variables  are  not  fully  captured. 
Flowever,  practical  applicability  is  the  focus  in  this  paper  because  it  is  very  important  for  large-scale  problems.  In 
the  following  sections,  the  proposed  method  will  be  explained  in  detail,  and  its  strengths  and  weaknesses  will  be 
fully  discussed.  To  demonstrate  the  effectiveness  of  the  proposed  method,  analytical  examples  and  a  large-scale 
industrial  problem  are  used. 

2.  Variable  Screening 

As  explained  in  the  introduction,  screening  out  variables  means  finding  important  variables  among  all  random 
variables.  Here,  the  word  “important”  could  have  different  meanings  depending  on  the  problem  we  are  dealing  with. 
In  the  following  two  sections,  the  difference  between  variable  screening  for  DDO  and  RBDO  will  be  explained. 
Based  on  the  difference,  the  required  properties  of  variable  screening  for  RBDO  will  be  introduced. 

2,1  Variable  Screening  for  DDO 

A  DDO  problem  can  be  formulated  as 

minimize  cost(d) 
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subject  to 


(1) 


Gj{  d)<0,  j  =  \,...,NC 

d1  <  d  <  dc ,  deR  w 

where  d,  G„  NC,  and  NDV  are  the  design  variable  vector.  /111  constraint  function,  number  of  constraints,  and  number 
of  design  variables,  respectively. 

As  stated  before,  in  the  DDO  problem,  the  input  design  variables  do  not  have  uncertainty,  and  thus  the  design 
sensitivity  can  be  used  as  a  barometer  to  determine  the  importance  ranking  of  design  variables  with  respect  to  the 
performance  measure.  The  question  is:  “Where  should  the  importance  ranking  of  design  variables  be  determined?” 
or  “Where  should  the  design  sensitivity  be  calculated?” 

The  LSA  calculates  the  design  sensitivity  at  a  given  design  point  [21,  22].  Usually,  LSA  is  used  to  provide  the 
direction  of  design  movement  in  the  optimization  process.  For  variable  screening,  LSA  can  provide  the  importance 
ranking  of  design  variables  at  the  current  design  point.  However,  the  importance  ranking  at  the  given  design  point 
could  be  different  from  the  ranking  at  other  design  points  if  the  performance  measure  is  a  nonlinear  function  of 
design  variables.  On  the  other  hand,  GSA  is  used  to  calculate  overall  design  sensitivity  on  the  entire  design  domain. 
The  GSA  is  like  averaged  design  sensitivity  in  the  design  domain.  As  it  is  an  average,  the  importance  ranking  using 
GSA  could  mislead  at  specific  points  or  even  regions.  Hence,  LSA  and  GSA  have  advantages  and  disadvantages  for 
variable  screening  [21]. 

2.2  Variable  Screening  for  RBDO 
A  general  RBDO  problem  can  be  formulated  as 
minimize  cost  (d) 

PF  =  P[Gj (X)  >  0]  <  PTrar  j  =  \,...,NC 

subject  to  '  '  (2) 

d1  <  d  <  dw,  de  Rw,  and  X  e  Rw 

where  d,  G„  Pj‘" ,  NC,  NDV,  and  NR V are  the  design  variable  vector,/11  constraint  function,/11  target  probability  of 

failure,  number  of  constraints,  number  of  design  variables,  and  number  of  random  variables,  respectively. 

In  the  RBDO  process,  design  variable  vector  d  is  the  mean  vector  of  the  corresponding  random  variable  X. 
Though  the  design  variable  d  is  deterministic,  the  design  sensitivity  for  RBDO  should  consider  the  randomness  of  X 
because  the  constraints  are  based  on  the  probabilistic  performance  measure  P[G;(X)>0]  as  shown  in  Eq.  (2). 
Therefore,  the  design  sensitivity  of  the  performance  measure  alone  cannot  be  used  as  a  barometer.  The  design 
sensitivity  of  the  probabilistic  performance  measure  can  be  obtained  by  using  several  methods,  such  as  FORM  [23- 
25],  DRM  [26,  27],  and  sampling-based  stochastic  sensitivity  [28,  29].  The  design  sensitivity  of  the  probabilistic 
performance  measure  can  be  used  for  variable  screening.  The  design  sensitivities  by  those  methods  are  LSA 
because  they  provide  different  sensitivities  at  different  designs.  The  GSA  method  is  also  applicable  for  variable 
screening  in  RBDO  problems  as  it  is  in  DDO  problems.  Again,  both  LSA  and  GSA  methods  have  advantages  and 
disadvantages. 

The  random  parameters  will  not  increase  the  dimensionality  of  the  optimization  problem  because  they  are  not 
random  design  variables.  However,  the  surrogate  model  that  includes  random  parameters  is  still  required  because 
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they  affect  the  output  distribution.  The  main  objective  of  this  paper  is  to  select  important  design  variables  so  that 
accurate  surrogate  models  can  be  generated  and,  at  the  same  time,  an  appropriate  optimum  design  (i.e.,  not 
suboptimum)  can  be  obtained  in  the  RBDO  process.  Hence,  once  variable  screening  is  done,  the  screened-out 
random  design  variables  need  to  be  fixed,  not  to  be  a  random  parameter.  However,  fixing  a  random  variable  as  a 
deterministic  variable  will  reduce  the  total  output  variability. 

Consider  a  simple  example: 

X,~N( 5,32),  i  =  1,2, ...  ,10 


10  / 

Y  =  YJXi  ~N  50,(3V10) 


If  the  probabilistic  performance  measure  is  P[Y>  60],  then  the  reliability  analysis  result  is 


P[T>60]  =  1-® 


60-50 

3Vi() 


=  0.1459 


(3) 


(4) 


However,  if  one  dimension  is  reduced  by  screening  out  Xl0  =  //Kl  =  5  while  the  other  variables  remain  random,  then 
the  probabilistic  performance  measure  changes  to 

9 


7  =  +5~N(50,92) 


(5) 


As  a  consequence,  the  reliability  analysis  result  yields 


prf>60l  =  l-0  60  50  =0.1333 
L  J  Q 


(6) 


From  Eqs.  (4)  and  (6),  0.0126  (1.26%)  of  the  output  variability  is  decreased  by  screening  out  one  variable.  A  more 
fundamental  problem  is  that  the  lost  amount  1.26%  cannot  be  estimated  without  the  full-dimensional  reliability 
analysis  result  of  Eq.  (4).  On  the  other  hand,  let’s  assume  that  Afi0  has  a  smaller  variance  of  one.  Then,  the  full¬ 
dimensional  reliability  analysis  yields 


>  60)  =  1-0 


60-50 
v  M 


=  0.1347 


(7) 


From  Eqs.  (6)  and  (7),  the  difference  is  0.0014  (0.14%),  which  could  be  acceptable.  Therefore,  in  this  case,  Xl0 
could  be  fixed  at  the  mean  value.  As  shown  in  the  example,  the  output  variability  decreases  if  any  random  variable 
is  fixed  at  a  deterministic  value.  However,  there  are  some  variables  that  affect  the  output  variability  a  small  amount. 
The  variable  screening  method  for  effective  surrogate  models  for  RBDO  is  to  find  those  variables  that  have  small 
effects  on  the  output  variability.  It  is  noted  that  the  random  parameters  are  considered  as  much  as  the  random  design 
variables  in  this  paper.  Even  though  the  random  parameters  are  not  changing  during  the  RBDO  process,  they  will 
influence  the  output  variability.  Hence  they  should  be  considered  in  the  variable  screening  process,  so  that 
reliability  of  the  performance  measure  can  be  accurately  approximated  using  reduced  dimension. 


3.  Variable  Screening  with  1-D  Surrogate  Model 

The  probability  of  failure  cannot  be  solely  determined  by  the  output  variability.  To  obtain  accurate  probability 
of  failure,  the  output  distribution  is  needed,  that  is,  all  statistical  information  of  the  output  is  required.  However, 
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even  though  an  input  distribution  is  known,  it  is  very  difficult  to  obtain  complete  output  distribution  since  the 
performance  measure  could  be  implicit,  a  non-linear  function,  or  even  both.  For  example,  for  a  given  normal  input 
distribution,  the  output  distribution  could  be  bimodal  as  well  as  asymmetric.  Consequently,  it  is  impractical  to  select 
a  reduced  number  of  input  variables  based  solely  on  probability  of  failure.  As  discussed  in  previous  sections,  a 
screened-out  variable  will  be  fixed  at  its  mean  value.  Then  the  change  of  output  mean  will  be  minimized.  As  a 
result,  the  output  variability  becomes  the  measure  that  can  determine  a  probability  of  failure.  Of  course,  other 
statistical  moments  or  parameters,  such  as  skewness  and  kurtosis,  could  affect  probability  of  failure.  However, 
either  of  these  statistical  moments  cannot  be  a  measure  by  itself.  For  example,  a  variable  that  induces  larger  (or 
smaller)  output  skewness  may  not  be  an  important  variable,  but  it  could  be  an  important  variable  when  it  induces 
larger  (or  smaller)  output  skewness  and  very  similar  output  variability.  We  could  consider  a  combination  of  the 
moments  as  a  measure,  but  there  are  too  many  possible  combinations  to  consider.  Hence,  under  the  assumption  that 
the  output  mean  is  similar,  the  output  variability  is  chosen  as  the  measure  to  select  vital  variables  for  RBDO  in  this 
paper. 

The  output  variability  can  be  quantified  by  the  output  variance  as  shown  in  the  previous  section.  The  exact 
output  variance  of  a  nonlinear  implicit  performance  measure  is  very  difficult  to  obtain.  Hence,  an  approximated 
output  variance  is  used  in  this  paper.  In  the  following  sections,  the  output  variance  is  decomposed  into  partial  output 
variances,  which  are  the  output  variances  when  each  input  variable  is  random  and  the  others  are  fixed  at  their  mean 
values.  Then,  a  method  to  find  the  design  variables  that  have  a  large  impact  on  output  variance  is  developed  using  a 
hypothesis  testing. 


3.1  Approximated  Output  Variance 

A  univariate  DRM  is  a  well-known  approximation  method  for  statistical  moments  using  multiple  1-D 
integrations  [34].  Consider  a  performance  measure  Y  and  its  realization  y  subject  to  random  input  vector 

X={A1,...Wv}T: 

Y{X)  =  Y{Xx,...,Xn),  y{*)  =  y{xl,...,xN)  (8) 

Define  a  function  7„  which  is  the  performance  measure  when  Xt  is  random  and  other  variables  are  fixed  at  their 
mean  values,  as 

Yi=Y(jUl,...,jUi_l,Xi,jUi+l,...,JuN)  (9) 


The  realization  of  the  performance  measure  at  the  input  mean  point  is  defined  as 

To  =  XH'x) 

The  Ith  statistical  moment  of  Y,  which  is  approximated  using  the  univariate  DRM,  is  defined  as  [34] 


m,  =  E 


(10) 


(11) 


Then,  the  output  variance  <j\  can  be  approximated  as 

CTy  =  m2  -m[  =  +  2^  pyy  aY  Gy 

i  i> ] 


(12) 
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where  a\  is  the  variance  of  Eq.  (9),  which  is  the  partial  output  variance  when  only  X,  is  random,  and  pYY  is  the 
correlation  coefficient  between  Y:  and  Yr  As  shown  in  Eq.  (12),  the  partial  output  variances  <JY  are  the  main 
variables  for  approximating  the  output  variance  <JY  .  When  aY  is  larger  than  other  partial  output  variances,  it  takes 
the  largest  portion  in  the  output  variance  crY .  Therefore,  if  some  X,  produces  larger  partial  output  variance  than 
others,  then  Xt  should  be  selected  as  an  important  variable.  It  is  noted  that  calculation  of  aY  requires  only  1-D 

integration,  and  thus  only  1-D  surrogate  models  are  required. 

Statistical  correlation  between  X,  and  Xj  yields  the  term  of  pYYaYaY  in  Eq.  (12)  and  affects  the  output  variance. 

When  A,  and  Xj  are  strongly  correlated,  one  could  be  replaced  by  the  other.  To  calculate  the  term  pYY  aYaY  ,  a  two- 

dimensional  surrogate  model  is  required.  If  there  are  only  a  few  correlation  pairs,  calculating  the  correlation  term 
could  be  affordable.  Elowever,  with  a  practical  point  of  view,  the  partial  output  variance  aY  is  the  focus  in  this 

paper.  As  we  are  looking  for  important  variables,  not  the  value  of  the  output  variance  of  aY ,  the  partial  output 
variance  would  be  enough  for  variable  screening.  In  Fig.  1,  contours  of  independent,  positively  correlated  (p  =  0.8) 
and  negatively  correlated  (p  =  -0.8)  probability  density  functions  are  shown.  Correlation  determines  how  the 
random  variables  are  distributed  inside  the  box  (dotted  line),  whereas  the  size  of  the  box  is  determined  by  variances 
of  X,  and  X2.  It  can  be  seen  that  the  primary  effect  on  output  variance  is  the  box  size,  and  then  distribution  inside 
the  box  follows.  Consequently,  to  perform  variable  screening  efficiently,  the  first  thing  we  need  to  consider  is  the 
box  size,  not  the  distribution  of  random  variables  inside  the  box.  Elence,  the  correlation  term  is  not  considered  in 
this  paper  for  efficiency  and  practicality.  It  is  noted  that  the  statistical  correlation  between  X,  and  Xj  will  be 
considered  in  reduced-dimensional  RBDO  if  both  variables  are  selected. 


Figure  1.  Effect  of  Variance  and  Correlation  of  Input  Random  Variables 
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The  partial  output  variance  of  aj  is  like  LSA  because  it  can  have  different  values  at  different  input  mean  points 

jix ,  which  is  the  current  design  point  in  the  RBDO  process.  Hence,  the  variable  screening  result  could  be  changed 
as  the  design  point  changes.  There  are  several  recommended  points  at  which  to  perform  variable  screening  using 
LSA.  The  first  one  is  the  DDO  optimum.  As  the  DDO  optimum  is  usually  close  to  the  RBDO  optimum,  the 
variable  screening  result  at  the  DDO  optimum  is  likely  to  be  similar  to  the  result  at  the  RBDO  optimum.  Also,  the 
design  point  where  most  of  the  deterministic  constraints  are  active  can  be  a  good  candidate  point.  It  is  noted  that 
DDO  or  the  design  point  where  the  constraints  are  active  could  be  obtained  using  the  finite  difference  method  in  a 
practical  engineering  problem.  Also,  DDO  could  be  achieved  using  the  sensitivity  obtained  from  a  1-D  surrogate 
model  because  DDO  requires  only  the  deterministic  LSA,  which  is  1-D. 


3.2  Variable  Screening  Using  Hypothesis  Testing 

Using  the  1-D  surrogate  model,  the  partial  output  variance  a\  can  be  calculated  approximately  as 


si  =- 


1 


ni1-! 


ns  2 


(13) 


where  x\j)  is  the  fh  realization  of  the  input  random  variable  Xh  ns  is  the  number  of  samples,  and  y.  is  the  mean  of  yt 


•th 


as, 

j  ns 

y,  (14) 

ns  % i 

As  explained  in  previous  sections,  the  partial  output  variance  s \  can  be  used  to  determine  important  design 

variables.  To  make  the  variable  screening  procedure  systematic,  hypothesis  testing  is  applied  in  this  paper. 
Hypothesis  testing  can  prevent  undesirable  choices  that  could  occur  during  the  decision-making  procedure. 
Calculated  partial  output  variance  si  depends  on  the  number  of  samples  ns.  When  ns  is  large  enough,  the  variable 

screening  result  will  be  accurate.  However,  it  would  require  a  large  computational  time.  Also,  it  is  hard  to 
determine  what  value  of  ns  is  large  enough.  When  ns  is  small,  it  will  include  statistical  error.  If  calculated  si  are 

distinctive  from  each  other  or  with  respect  to  the  screening  threshold  value,  then  the  effect  of  ns  may  not  be 
significant.  However,  ns  could  cause  an  error  when  some  si  are  similar  to  each  other  or  are  near  the  screening 

threshold  value.  Hypothesis  testing  can  prevent  this  problem  in  a  statistical  manner  by  letting  users  control  the  error 
level. 

Various  hypothesis  testing  methods  have  been  developed  for  the  decision-making  problem  [35].  Among  those 
methods,  we  need  the  one  that  is  not  sensitive  to  distribution  type  because  the  distribution  type  of  Yt  or  sj  is  not 

known  in  general.  The  one-sample  /-test  is  developed  based  on  the  central  limit  theorem,  which  states  that  the 
sample  mean  of  non-normal  distribution  follows  normal  distribution  approximately  for  a  large  number  of  samples. 


The  one-sample  /-test  is  not  sensitive  to  underlying  distribution  types,  so  it  is  used  in  this  paper.  As  the  /-test  is  a 
method  for  sample  mean,  ,sy  is  calculated  nr  times  for  its  statistical  moments  as 


=-IX 


2  (*) 


2 

=- 


T2> 
1  k= i 


f>-v])2 


(15) 

(16) 


where  Sy(k)  is  the  kth  repetition  of  s2  and  nr  is  the  number  of  repetitions.  Now,  the  hypothesis  is  constructed: 


H0  :  v;  <  ju0  versus  //,  :  vt  >  fin  (17) 

where  juq  is  the  criterion  of  hypothesis  testing.  According  to  Eq.  (17),  the  design  variable  that  corresponds  to  vj , 

which  is  greater  than  juq  (Hi  is  true),  will  be  selected  as  an  important  variable.  Using  the  one-sample  /-test,  the 
hypothesis  can  be  tested  by  checking  the  following  statement: 

Reject  H0  in  favor  of  H\  if  q>tnr_  {l_a  (18) 

where  a  is  the  significance  level,  tmA  ,  is  /,),_,  (7) ,  and  the  test  statistics  q  is  defined  as 


<7  =  (v,  ~Mo  ) 


(19) 


In  Eqs.  (15)  and  (16),  the  uncertainty  induced  by  ns  is  transferred  to  nr.  Elence,  ns  can  be  a  fixed  number, 
whereas  nr  should  be  decided  appropriately.  Also,  //0  needs  to  be  identified  in  Eq.  (17).  juq  is  the  key  criterion  that 
decides  important  variables,  and  it  should  be  a  value  relative  to  vj  because  the  relative  difference  of  partial  output 
variances  should  be  checked  for  variable  screening.  At  the  same  time,  /Jq  needs  to  be  statistically  independent  from 
E  for  reasonable  hypothesis  testing.  In  this  paper,  preliminary  testing  is  proposed  to  obtain  reasonable  nr  and  //{|  as 
follows.  First,  choose  nr0  which  is  large  enough  so  that  the  central  limit  theorem  holds.  Then,  calculate  the  initial 
statistical  moments  of  si  as 


m- 


'^sr' 


2{k) 


0  k=\ 


s2m  =  - 


1 


- 1 


Using  the  value  from  Eq.  (20),  the  testing  criterion  ju0  relative  to  v.  can  be  calculated  as 

V  N 


(20) 

(21) 

(22) 


where  y  is  a  constant  that  the  user  selects,  nr  is  calculated  by  limiting  type  II  error  (H0  is  accepted  when  H j  is  true) 
at  the  level  of  false  negative  rate  p  as  [35] 


nr  =  max 


E,  (f  -  I .  I  —a  ) 


v/0)  “  Mo 


-,nr0 


(23) 
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In  Eq.  (23),  tnr_h,  should  be  used  instead  of  t  h,  for  accurate  calculation  of  nr.  However,  Eq.  (23)  requires  the 
value  of  nr  on  the  right  side  to  calculate  nr.  To  avoid  this  problem,  tnr  .  is  used  instead,  and  t  produces  a 
conservative  result  as  it  is  larger  than  tnr_x .  because  nr  is  larger  than  nr0  in  Eq.  (23)  and  a  is  small.  Finally,  nr  and 
/Jq  are  determined  so  that  the  proposed  hypothesis  testing  can  be  utilized. 

3.3  1-D  Surrogate  Model 

In  previous  sections,  the  1-D  surrogate  model  is  treated  as  the  given  one  because  it  is  not  difficult  to  generate. 
However,  efficiently  creating  a  1-D  surrogate  model  could  be  an  issue.  For  efficiency,  quadratic  interpolation  is 
proposed  as  a  basic  1-D  surrogate  model  in  this  paper.  Quadratic  interpolation  may  not  be  an  adequate  method  for 
creating  a  surrogate  model  for  a  highly  nonlinear  performance  measure.  However,  a  nonlinear  performance  measure 
can  be  effectively  approximated  by  a  quadratic  function  on  a  small  region.  If  X  follows  normal  distribution,  the 
domain  oi X  is  (-oo,  oo),  whereas  99.73%  of  X  is  in  (jux~3ax,  jux+3ax),  which  is  much  smaller  than  the  infinite 
domain.  Even  if  X  does  not  follow  normal  distribution,  the  region  ( jUx  —3  ay.  jux+3  ay)  can  cover  almost  all 
(approximately  98%)  ofX  In  view  of  the  fact  that  this  paper  is  focused  on  calculation  of  partial  output  variance,  the 
region  (jux~3ax,  jux+3  ax)  is  large  enough.  Hence,  the  1-D  surrogate  model  needs  to  be  accurate  in  the  region  (jux 
-3  ay,  jux+3ax)  so  that  quadratic  interpolation  could  be  an  appropriate  method  to  approximate  the  performance 
measure  in  the  region. 

Quadratic  interpolation  requires  three  samples,  and  the  location  of  samples  affects  the  accuracy  of  interpolation. 
The  location  of  samples  determined  using  the  Chebyshev  polynomial  is  known  to  give  uniform  error  in  the  domain 
[36].  Because  only  the  region  (jUx~ 3<jx,  /ux+3 ay)  is  of  interest,  the  location  of  samples  is  determined  as  xt  =/ux 
-2.598 1  ay,  x2  =jux  and  x3  =jux  +2.598 1  ay  using  the  Chebyshev  polynomial.  Since  a  random  variable  X  may  not  be 
evenly  distributed  in  its  domain,  providing  uniform  error  does  not  necessarily  mean  that  the  calculated  partial  output 
variance  is  accurate.  However,  since  no  unique  location  of  samples  is  best  for  accurate  partial  output  variance,  the 
sample  location  by  the  Chebyshev  polynomial  is  used  in  this  paper  due  to  the  fact  that  it  yields  reasonable  results  for 
various  distribution  types  of  the  random  variable  X.  If  the  random  variable  X  has  a  closed  and  bounded  domain  like 
[a,  Z>],  the  domain  can  be  directly  used  for  calculation  of  partial  output  variance,  and  the  location  of  samples  are 
X\  =  0.93301a  +  0.06699 b,  x2  =  (a  +  b)/2  and  x3  =  0.06699a  +  0.9330 16,  using  Chebyshev  polynomials. 

To  check  the  performance  of  a  selected  location  of  samples,  a  nonlinear  performance  measure  Y  is  used  as 

Y(X)  =  0.3  +  sin(l  6X  / 15  -  0.7)  +  sin2  (16X  / 15  -  0.7)  (24) 

Assuming  that  random  variable  X  follows  N( 0.5,  0.3332),  three  locations  of  samples  are  chosen  to  compare  the 
accuracy  of  the  partial  output  variance.  The  first  location  is  {0.167,  0.5,  0.833},  which  is  jux  and  jUx±ax  and  the 
second  location  is  from  the  Chebyshev  polynomial  as  {-0.365,  0.5,  1.365}.  The  third  location  is  wider,  as  {-0.667, 
0.5,  1.667},  which  is  jux  and  jux±3.5crx.  Partial  output  variances  are  calculated  using  100,000  realizations  of  A,  and 
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true  partial  output  variance  is  calculated  by  Eq.  (24)  with  the  same  realizations.  To  check  the  accuracy  of  the 
quadratic  interpolation  itself,  mean  square  error  (MSE)  is  calculated  in  the  region  of  (-0.5,  1.5),  which  is  (jux~3ax, 
Hx  +3  <7X)  with  100  uniformly  distributed  points.  The  calculated  result  is  shown  in  Table  1,  and  the  shape  of 
quadratic  interpolations  is  shown  in  Fig.  2,  where  asterisk  marks  (*)  represent  the  sample  point.  As  shown  in  Table 
1,  the  location  of  the  sample  using  Chebyshev  polynomials  produces  more  accurate  partial  output  variance 
compared  to  the  true  one  and  less  MSE  than  the  other  cases. 


Table  1.  Quadratic  Interpolation  with  Different  Sample  Locations 


Quadratic  interpolation 

Location  of  samples 

{0.167,  0.5,0.833} 

{-0.365,0.5,  1.365} 

{-0.667,0.5,  1.667} 

Partial  Value 

8.74E-02 

7.84E-02 

6.92E-02 

7.84E-02 

output 

variance  Accuracy 

111.5% 

100.0% 

88.3% 

100.00% 

MSE 

1.17E-02 

1.19E-03 

9.00E-03 

0 

(b)  Location  of  sample:  {-0.365,  0.5,  1.365} 


(c)  Location  of  sample:  {-0.667,  0.5,  1.667} 


Figure  2.  Quadratic  Interpolation  of  Y  with  Different  Locations  of  Samples 

This  example  cannot  represent  all  performance  measures.  When  a  highly  nonlinear  performance  measure  is 
expected,  more  sophisticated  surrogate  methods,  such  as  the  RBF,  PRS,  SVR,  Kriging,  and  DKG  [2-10]  methods, 
are  better.  In  any  case,  it  is  recommended  to  sample  inside  the  region  of  (jux~ 3  ax,  jUx+ 3  ax)  for  the  random  variable 
X  if  the  distribution  has  an  infinite  domain. 

4.  Numerical  Examples 
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Analytical  examples  and  an  engineering  example  are  used  to  test  the  performance  of  the  proposed  variable 
screening  method.  Partial  output  variances  are  calculated  to  select  important  variables  using  the  1-D  quadratic 
interpolation  presented  in  Section  3.3.  To  use  the  variable  screening  method,  five  parameters:  significance  level  a, 
false  negative  rate  /?,  number  of  sample  ns,  initial  number  of  repetition  nr0  and  control  parameter  y  for  threshold 
value,  need  to  be  decided  by  users.  Smaller  a  and  ft  are  better  choices  because  they  result  in  less  statistical  errors  in 
the  variable  screening  method.  However,  when  they  are  too  small,  very  large  nr  could  be  required  to  maintain  the 
error  level  specified  by  a  and  ft  in  Eq.  (23).  Hence,  0.025  to  0.05  would  be  a  reasonable  choice  for  them.  For  nr0 
and  ns,  a  small  number  could  be  chosen  to  reduce  computational  cost.  However,  a  small  value  of  nr„  and  ns  will 
rapidly  increase  nr  to  maintain  the  error  level.  Hence,  an  appropriately  large  number  should  be  used;  they  are  set  to 
be  50  or  100  in  the  numerical  examples.  The  parameter  y  in  Eq.  (22)  is  for  user  control  of  the  threshold  value  that 
determines  important  variables.  In  the  numerical  example,  y  is  initially  set  to  1.0,  and  the  variable  screening 
procedure  is  performed.  Then,  the  ratio  of  the  sum  of  partial  output  variances  of  selected  variables  to  that  of  all 
random  variables,  which  is  an  estimation  of  the  captured  output  variance,  is  checked.  If  the  ratio  is  less  than  85%,  y 
is  lowered  to  achieve  85%.  As  explained  before,  the  partial  output  variance  is  an  approximation  method,  which  is 
why  the  ratio  of  85%  may  not  mean  that  85%  of  total  output  variance  is  actually  captured  in  the  selected  variables. 
However,  it  would  be  a  good  estimation  with  affordable  cost  because  it  does  not  require  many  design  of  experiments 
(DoE)  samples  or  full-dimensional  surrogate  models. 

4. 1  Analytical  Examples 

Hartmann  6-D  and  Dixon-Price  12-D  are  well-known  analytical  functions.  They  are  high-dimensional  as  well 
as  nonlinear,  so  they  are  tested  for  the  variable  screening  method.  Constant  terms  are  added  to  the  original  functions 
to  make  both  functions  active  (i.e.  G(X)  =  0)  at  the  mean  point  of  the  input  random  variables.  Note  that  adding  a 
constant  term  does  not  change  the  character  of  the  functions.  Input  random  variables  have  a  variety  of  marginal 
distribution  types  and  copula  types,  so  the  analytical  examples  can  reveal  the  effects  of  different  distribution  types 
and  correlations. 

The  functions  are  tested  with  three  different  methods.  The  first  is  the  variable  screening  method.  As  mentioned 
before,  parameters  of  a  =  ft  =  0.025,  nr0  =  ns  =  100  and  y  =  1.0,  and  1-D  surrogate  model  with  quadratic 
interpolation  are  used,  y  is  initially  set  to  1 .0  and  lowered  when  necessary.  The  second  is  screening  with  accurate 
partial  output  variances  using  the  analytical  functions  directly  and  1,000,000  realizations  of  random  variables.  The 
calculated  partial  output  variances  are  used  as  reference.  When  a  performance  measure  is  a  linear  function  (G  = 
X aXd  of  the  input  random  variables  X-' s,  the  output  variance  is  'Lol]g1x  ,  where  a1  is  variance  Xt.  Hence,  the 

partial  output  variance  can  be  linearly  approximated  as  a]a\  with  a  design  sensitivity  (gradient)  a,  and  the  input 
variance  <jlx  ■  Furthermore,  an  important  variable  might  be  selected  based  on  the  partial  output  variances  calculated 
with  the  sensitivity-variance  method,  and  it  is  applied  to  the  analytic  functions  for  comparison. 

4.1.1  Hartmann  6-D 
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The  first  analytical  example  is  the  Hartmann  6-D,  and  a  constant  term  is  added  as  explained  before.  The 
analytical  expression  is  shown  as  [37] 


where  0  <Xt  <  1,  m  =  6,  q  =  4  and 


G(x)  =  -£  ai  exP  bu 


+  3.3082 


1=1 

V 

j= i 

/ 

a 

=  [1.0 

1.2 

3.0 

3.2] 

10.0 

3.0 

17.0 

3.5 

1.7 

8.0 

0.05 

10.0 

17.0 

0.1 

8.0 

14.0 

3.0 

3.5 

1.7 

10.0 

17.0 

8.0 

17.0 

8.0 

0.05 

10.0 

0.1 

14.0 

0.1312 

0.1696 

0.5569 

0.2329 

0.4135 

0.8307 

0.2348 

0.1451 

0.3522 

0.4047 

0.8828 

0.8732 

0.0124 

0.8283 

0.5886 

0.3736 

0.1004 

0.9991 

0.2883 

0.3047 

0.6650 

0.5743 

0.1091 

0.0381 

(25) 


(26) 


(27) 


(28) 


Information  about  input  random  variables  is  listed  in  Table  2.  Input  random  variables  have  four  different  marginal 
distribution  types:  normal,  lognormal,  gamma,  and  Weibull.  X5  and  X(  are  correlated  with  the  Clayton  copula  and 
Kendall’s  tau  of  0.5. 


Table  2.  Input  Random  Variables  for  Hartmann  6-D  Example 


Random 

Distribution 

Mean 

STDEV 

Correlation 

Variable 

Type 

(Copula) 

V, 

Normal 

0.20 

0.009 

- 

x2 

Lognormal 

0.15 

0.007 

- 

x 3 

Gamma 

0.48 

0.015 

- 

X4 

Weibull 

0.28 

0.014 

- 

X 5 

Normal 

0.31 

0.014 

Clayton 

V6 

Weibull 

0.68 

0.013 

t=  0.5 

The  result  of  variable  screening  is  shown  in  Table  3.  The  design  sensitivity  (gradient)  of  the  Hartmann  6-D  in 
Eq.  (25)  at  the  input  mean  point  is  shown  in  the  second  column,  and  the  third  through  fifth  columns  show  partial 
output  variances  using  the  sensitivity-variance  method,  the  variable  screening  method,  and  the  accurate  method, 
respectively.  As  the  variable  screening  method  calculates  partial  output  variances  nr  times,  the  result  is  the  mean 
value  of  the  calculated  partial  output  variances.  In  each  method,  important  variables  are  marked  with  bold  font.  It 
can  be  seen  that  the  variable  screening  method  finds  the  same  variables  as  the  accurate  method,  whereas  the 
sensitivity-variance  method  misses  X4  and  X5.  The  sixth  and  seventh  columns  are  the  ratios  of  partial  output 
variances  using  the  sensitivity-variance  method  and  the  variable  screening  method  to  the  accurate  partial  output 
variances,  respectively.  It  is  evident  that  the  sensitivity-variance  method  cannot  estimate  the  partial  output  variances 
accurately,  while  the  variable  screening  method  does.  Overall,  the  variable-screening  method  outperforms  the 
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sensitivity-variance  method.  Hence,  it  is  better  to  use  at  least  quadratic  approximation  for  the  1-D  surrogate  model 
to  calculate  partial  output  variances. 


Table  3.  Partial  Output  Variances  of  Hartman  6-D  Example 


Random 

Variable 

Design 

Sensitivity 

Sensitivity 

-Variance 

Variable 

Screening 

Accurate 

Sens.-Var. 

/Accurate 

Variable  Scr. 
/Accurate 

-5.88E-02 

2.80E-07 

2.24E-06 

2.25E-06 

12.4% 

99.6% 

v2 

-1.36E-02 

9.04E-09 

6.12E-07 

6.17E-07 

1.5% 

99.2% 

V, 

7.15E-02 

1.15E-06 

1.1  IE— 05 

1.07E-05 

10.7% 

103.7% 

v4 

2.63E-01 

1.35E-05 

8.88E-05 

8.67E-05 

15.6% 

102.4% 

v5 

-1.36E-01 

3.61E-06 

1.86E-04 

1.86E-04 

1.9% 

100.0% 

^6 

1.17E+00 

2.29E-04 

1.65E-04 

1.66E-04 

138.0% 

99.4% 

Selected 

2.29E-04 

4.40E-04 

4.39E-04 

- 

- 

All 

2.48E-04 

4.54E-04 

4.52E-04 

- 

- 

Selected  /  All 

92.3% 

96.9% 

97.1% 

- 

- 

Criterion  (ju0) 

4.13E-05 

7.42E-05 

7.54E-05 

- 

- 

In  Table  3,  the  bottom  four  rows  show  more  information  about  each  method.  The  first  and  second  rows  are  the 
sums  of  partial  output  variances  of  selected  variables  and  all  variables,  respectively.  The  third  row  shows  the  ratio 
of  the  first  row  to  the  second  row.  The  last  row  is  jUq,  which  is  the  criterion  used  to  select  important  variables.  In  the 
variable  screening  method,  the  important  variables  (X4,  X5  and  X6)  are  determined  to  be  larger  than  //,,  by  hypothesis 
testing.  The  sensitivity-variance  method  and  the  accurate  method  select  important  variables  if  the  variable  has 
partial  output  variance  larger  than  /j0.  In  the  third  row,  the  ratio  for  the  variable  screening  method  is  larger  than  85%, 
so  y  is  the  initial  value  of  1.0,  and  equivalent  /j0  is  applied  for  other  methods.  The  sensitivity-variance  method 
estimates  that  92.3%  of  output  variance  is  contained  in  X6  only.  This  is  a  very  poor  estimation,  as  only  36.7% 
(=1.66E-04/4.52E-04)  of  output  variance  is  captured  in  X6  according  to  the  result  of  the  accurate  method.  On  the 
other  hand,  the  variable  screening  method  estimates  that  96.9%  of  output  variance  is  contained  in  X4,  X5  and  X6,  and 
this  is  very  accurate  compared  to  the  97.1%  determined  by  the  accurate  method. 

However,  the  total  and  captured  output  variances  in  Table  3  are  approximation  using  the  partial  output 
variances  and  the  correlation  term  is  not  considered  as  explained  in  Section  3.1.  Having  the  analytical  expression  of 
the  Hartman  6-D  example  in  Eq.  (25),  true  total  output  variance  induced  by  multiple  input  random  variables  can  be 
calculated  as  well.  The  true  total  output  variance  is  calculated  using  1,000,000  realizations  of  all  input  random 
variables  and  the  calculated  value  is  4.23E-04  as  shown  in  Table  4.  Recalling  the  approximated  result  in  Table  3, 
the  variable  screening  method  (4.54E-04)  and  the  accurate  method  (4.52E-04)  well  approximate  the  true  total 
output  variance,  whereas  the  sensitivity-variance  method  (2.48E-04)  is  not  able  to  do  so.  In  Table  4,  true  captured 
output  variance  by  the  selected  variables  is  also  calculated.  To  calculate  the  true  captured  output  variance,  the 
realizations,  which  are  generated  to  calculate  the  true  total  output  variances,  are  used.  Among  them,  the  realizations 
of  the  screened-out  variables  are  fixed  at  their  mean  values.  Then,  the  variance  of  Hartman  6-D  is  calculated  using 
the  modified  realizations.  The  true  captured  output  variance  in  X6  is  1 .66E-04,  which  is  the  same  as  the  partial 
output  variance  of  X6  found  by  the  accurate  method  (see  Table  3).  Hence,  the  sensitivity- variance  method  captures 
only  39.2%  (1.66E-04/4.23E-04)  of  the  true  total  output  variance  in  its  selection  X6.  This  will  lead  a  reliability 
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problem  not  to  estimate  the  probability  of  failure  correctly.  By  contrast,  the  captured  output  variance  in  X4,  X5  and 
X6  is  4. 10E-04,  and  this  is  96.9%  (4. 10E-04/4.23E-04)  of  the  total  output  variance.  This  indicates  that  a  reliability 
problem  could  be  solved  accurately  utilizing  X4,  Xs  and  X6.  From  this  example,  it  can  be  seen  that  the  variable 
screening  method  works  as  it  is  intended. 


Table  4.  True  Total  and  Captured  Output  Variances  of  Hartman  6-D  Example 


Sensitivity 

-Variance 

Variable  Screening 
&  Accurate 

True  Total  Output 
Variance 

Selected  Variables 

V6(l-D) 

X4,X5,X6(3-D) 

All  (6-D) 

Captured  Output  Variance 

1.66E-04 

4.10E-04 

4.23E-04 

Ratio  to  Total  Out.  Var. 

39.2% 

96.9% 

100.0% 

4.1.2  Dixon-Price  12-D 

The  second  analytical  example  is  the  Dixon-Price  12-D,  and  again  a  constant  term  is  added  to  its  original 
function.  The  analytical  expression  is  shown  as  [38] 


G(X)  =  (X,  -l)2  +YJi(2X;  -VM)2  -  3.5575  xlO"3  (29) 

i-2 

where  -10  <  xt  <  10,  i  =  1,  2,  ...,  m  and  m  =  12.  Input  random  variables  shown  in  Table  5  are  used  for  the  test. 
They  have  five  different  marginal  distribution  types  of  normal,  lognormal,  Weibull,  Gumbel,  and  Gamma.  X\  and 
X2  are  correlated  with  the  Frank  copula  and  Kendall’s  tau  of  0.7.  Also,  X5  and  X6  are  correlated  with  the  FGM 
copula  and  Kendall’s  tau  of  0.2. 


Table  5.  Input  Random  Variables  for  Dixon-Price  12-D  Example 


Random 

Distribution 

Mean 

STDEV 

Correlation 

Variable 

Type 

(Copula) 

Normal 

1.00 

0.025 

Frank 

Normal 

0.71 

0.02 

II 

o 

x% 

Lognormal 

0.59 

0.02 

- 

x4 

Lognormal 

0.55 

0.02 

- 

X 5 

Weibull 

0.52 

0.02 

FGM 

X(, 

Weibull 

0.51 

0.02 

<N 

O 

II 

Xn 

Gumbel 

0.51 

0.015 

- 

x. 

Gumbel 

0.50 

0.015 

- 

x9 

Normal 

0.50 

0.015 

- 

xw 

Normal 

0.50 

0.01 

- 

Xu 

Gamma 

0.50 

0.01 

- 

Xn 

Gamma 

0.50 

0.015 

- 

The  test  result  of  the  Dixon-Price  12-D  example  is  shown  in  Table  6,  and  selected  variables  in  each  method  are 
marked  with  bold  font.  In  this  example,  the  value  of ;/  is  lowered  to  0.7  to  contain  at  least  85%  of  output  variance  in 
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the  selected  variables.  And  it  is  shown  that  86.6%  of  output  variance  is  estimated  using  the  variable  screening 
method  in  Table  6.  Design  sensitivities  with  respect  to  X9  ~  Xl2  are  zero,  and  accordingly  the  partial  output 
variances  of  X9~  Xu  using  the  sensitivity-variance  method  are  zero.  Hence,  the  sensitivity-variance  method  misses 
X9  and  X\2  even  though  they  have  large  partial  output  variances.  Moreover,  the  other  partial  output  variances  using 
the  sensitivity-variance  method  have  poor  accuracy  compared  to  the  accurate  method  (see  the  sixth  column  in  Table 
6).  Hence,  X2  and  X2  are  selected  instead  of  X6  even  though  X6  actually  has  larger  partial  output  variance  than  X2  and 
X^.  On  the  contrary,  the  variable  screening  method  reasonably  estimates  partial  output  variances  and  correctly 
identifies  important  variables  compared  to  the  accurate  method.  Therefore,  it  is  confirmed  that  at  least  quadratic 
approximation  is  needed  for  the  1-D  surrogate  model  to  calculate  partial  output  variances. 


Table  6.  Partial  Output  Variances  of  Dixon-Price  12-D  Example 


Random 

Variable 

Design 

Sensitivity 

Sensitivity 

-Variance 

Variable 

Screening 

Accurate 

Sens.-Var. 

/Accurate 

Variable  Scr. 
/Accurate 

-3.28E-02 

6.72E-07 

7.95E-06 

7.71E-06 

8.7% 

103.1% 

v2 

1.76E-01 

1.24E-05 

1.41E-04 

1.35E-04 

9.2% 

104.4% 

v3 

— 3.15E— 01 

3.98E-05 

1.56E-04 

1.62E-04 

24.6% 

96.3% 

Xg 

3.56E-01 

5.07E-05 

2.95E-04 

2.81E-04 

18.0% 

105.0% 

X5 

-1.94E-01 

1.50E-05 

4.98E-04 

4.45E-04 

3.4% 

111.9% 

X6 

-1.38E-01 

7.61E-06 

6.22E-04 

5.51E-04 

1.4% 

112.9% 

X~] 

4.51E-01 

4.58E-05 

5.27E-04 

5.57E-04 

8.2% 

94.6% 

X, 

-3.20E-01 

2.30E-05 

3.10E-04 

3.55E-04 

6.5% 

87.3% 

X9 

0.00E+00 

0.00E+00 

2.20E-04 

2.15E-04 

0.0% 

102.3% 

Xw 

0.00E+00 

0.00E+00 

5.31E-05 

5.25E-05 

0.0% 

101.1% 

Xu 

0.00E+00 

0.00E+00 

6.42E-05 

6.32E-05 

0.0% 

101.6% 

X\  2 

0.00E+00 

0.00E+00 

2.45E-04 

2.39E-04 

0.0% 

102.5% 

Selected 

1.87E-04 

2.72E-03 

2.64E-03 

- 

- 

All 

1.95E-04 

3.14E-03 

3.06E-03 

- 

- 

Selected  /  All 

95.9% 

86.6% 

86.3% 

- 

- 

Criterion  ( //0) 

1.14E-05 

1.68E-04 

1.79E-04 

- 

- 

Using  Eq.  (29),  the  true  total  and  captured  output  variances  of  Dixon-Price  12-D  example  are  calculated  as 
shown  in  Table  7.  In  Table  6,  the  variable  screening  (3.14E-03)  and  the  accurate  methods  (3.06E-03)  reasonably 
approximate  the  true  total  output  variance  (3.30E-03  in  Table  7),  while  the  sensitivity-variance  method  (1.95E-04) 
cannot.  In  Table  7,  the  true  captured  output  variance  by  selected  variables  using  the  sensitivity-variance  method  is 
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only  2.15E-03,  which  is  65.2%  of  the  true  total  output  variance.  By  contrast,  the  output  variance  of  2.80E-03  is 
contained  in  X4  ~  X9  and  Xl2,  which  indicates  that  84.8%  of  the  true  total  output  variance  is  captured.  Hence,  it  is 
verified  that  the  variable  screening  method  correctly  finds  the  important  variables  of  the  Dixon-Price  12-D  example. 
Through  analytical  examples,  it  is  shown  that  the  partial  output  variance  is  a  well-performing  measure  for  variable 
screening  purposes,  and  the  proposed  variable  screening  method  successfully  finds  important  variables  as  it  is 
intended. 


Table  7.  True  Total  and  Captured  Output  Variances  of  Dixon-Price  12-D  Example 


Sensitivity 

-Variance 

Variable  Screening 
&  Accurate 

True  Total  Output 
Variance 

Selected  Variables 

X2~X5,X7,X8(6-D) 

X4~X9,X12(7-D) 

All  (12-D) 

Captured  Output  Variance 

2.15E-03 

2.80E-03 

3.30E-03 

Ratio  to  Total  Out.  Var. 

65.2% 

84.8% 

100.0% 

4.2  Engineering  Example 

A  car  noise,  vibration,  and  harshness  (NVH)  and  crash  safety  problem  is  considered  to  demonstrate  the 
performance  and  efficiency  of  the  proposed  method.  The  problem  includes  full  frontal  impact,  40%  offset  frontal 
impact,  and  NVH  as  constraints.  There  are  a  total  of  11  performance  measures  as  shown  in  Table  8:  nine  safety 
measures  and  two  NVH  measures. 


Table  8.  Performance  Measure  Description 


Mode 

Function 

Value 

Feasibility  decision 

Safety 

Full  frontal 
impact 

G, 

Chest  G 

<  Baselinet 

g2 

Crush  displacement 

40%  offset 
impact 

g3 

Brake  pedal 

g4 

Footrest 

g5 

Left  toepan 

g6 

Center  toepan 

g7 

Right  toepan 

G* 

Left  IP 

g9 

Right  IP 

NVH 

G,„ 

Torsion  mode 

G„ 

Vertical  bending  mode 

In  this  example,  it  is  assumed  that  the  only  source  of  uncertainty  is  the  thickness  of  the  body  plates.  The  44 
random  variables  shown  in  Table  9  are  used  to  represent  the  thicknesses.  All  random  variables  follow  normal 
distribution  and  are  statistically  independent.  The  design  variable  vector  d  is  the  mean  vector  of  the  44  random 
variables,  and  there  is  no  random  parameter  in  this  example.  Among  those  random  variables,  six  random  variables 
(V|  to  X5  and  V8)  are  common  variables  for  both  safety  and  NVH  measures,  two  (X6  and  X7)  are  variables  only  for 
safety,  and  the  other  36  random  variables  are  only  for  NVH  measures. 


Table  9.  Input  Random  Variables 


RVs 

Dist.  Type 

d 

STDV 

dL 

RVs 

Dist.  Type 

d 

STDV 

dL 

V! 

Normal 

1.9 

0.05 

1.5 

2.3 

XNt 

Normal 

0.9 

0.03 

0.7 

it 

18 


x2 

Normal 

1.91 

0.05 

1.5 

2.3 

xn2 

Nonnal 

1.1 

0.03 

0.8 

1.4 

A3 

Normal 

2.51 

0.06 

2.0 

3.0 

xn3 

Nonnal 

1.55 

0.05 

1.2 

1.9 

a4 

Normal 

2.4 

0.06 

1.9 

2.9 

XN4 

Nonnal 

0.9 

0.03 

0.7 

1.1 

a5 

Normal 

2.55 

0.06 

2.0 

3.1 

XN5 

Nonnal 

1.5 

0.03 

1.2 

1.8 

a6 

Normal 

2.25 

0.06 

1.8 

2.7 

xn6 

Nonnal 

1.2 

0.03 

0.9 

1.5 

Xj 

Normal 

2.25 

0.06 

1.8 

2.7 

XN-, 

Nonnal 

1.1 

0.03 

0.8 

1.4 

X 8 

Normal 

1.5 

0.03 

1.2 

1.8 

XNS 

Nonnal 

1.52 

0.05 

1.2 

1.9 

Xio 

Normal 

1.28 

0.03 

0.9 

1.6 

XNt) 

Nonnal 

0.8 

0.03 

0.6 

1.0 

Xu 

Normal 

1.4 

0.03 

1.0 

1.8 

XNl0 

Nonnal 

0.8 

0.03 

0.6 

1.0 

X\2 

Normal 

1.1 

0.03 

0.8 

1.4 

XNU 

Nonnal 

1.2 

0.03 

0.9 

1.5 

X13 

Normal 

2.2 

0.06 

1.7 

2.7 

XNl2 

Nonnal 

0.75 

0.03 

0.6 

0.9 

X\4 

Normal 

1.5 

0.03 

1.2 

1.8 

xn13 

Nonnal 

0.75 

0.03 

0.6 

0.9 

XI5 

Normal 

1.25 

0.03 

0.9 

1.6 

xn14 

Nonnal 

0.75 

0.03 

0.6 

0.9 

Xl6 

Normal 

2.5 

0.06 

2.0 

3.0 

xn15 

Nonnal 

1.0 

0.03 

0.8 

1.2 

XI7 

Normal 

2.0 

0.05 

1.5 

2.5 

XNl6 

Nonnal 

1.14 

0.03 

0.9 

1.4 

X\g 

Normal 

1.4 

0.03 

1.1 

1.7 

XN„ 

Nonnal 

1.2 

0.03 

0.9 

1.5 

X20 

Normal 

1.22 

0.03 

0.9 

1.5 

XNlg 

Nonnal 

1.4 

0.03 

1.1 

1.7 

X23 

Normal 

0.75 

0.03 

0.6 

1.0 

XNlg 

Nonnal 

1.2 

0.03 

0.9 

1.5 

X24 

Normal 

1.9 

0.05 

1.5 

2.3 

XN20 

Nonnal 

1.4 

0.03 

1.1 

1.7 

X25 

Normal 

0.65 

0.03 

0.5 

0.8 

XN2l 

Nonnal 

2.13 

0.06 

1.7 

2.6 

X26 

Normal 

0.85 

0.03 

0.6 

1.1 

X21 

Normal 

0.85 

0.03 

0.6 

1.1 

This  problem  requires  three  and  a  half  hours  for  the  impact  dynamic  analysis  for  crash  safety  and  the  modal 
analysis  for  NVH.  Thus,  the  actual  analysis  takes  too  much  time  to  test  the  proposed  method  thoroughly.  Ford 
Motor  Company  provided  full-dimensional  global  (considering  the  entire  design  domain)  surrogate  models  so  that 
we  can  use  them  to  demonstrate  the  proposed  method  of  variable  screening.  The  full-dimensional  surrogate  models 
may  not  be  accurate,  since  44-D  is  too  high  to  create  accurate  surrogate  models,  especially  for  RBDO.  However,  to 
test  the  proposed  method  of  variable  screening,  the  responses  from  the  44-D  global  surrogate  models  are  treated  as 
true  responses  in  this  paper.  The  maximum  dimension  at  which  accurate  surrogate  models  can  be  generated  depends 
on  the  computational  power  and  nonlinearity  of  a  given  problem.  In  this  paper,  the  DKG  method  [10]  is  used  to 
generate  an  accurate  surrogate  model  using  the  Iowa  Reliability-Based  Design  Optimization  (I-RBDO)  code  [39], 
and  18-D  is  targeted  as  the  maximum  degrees  of  freedom  of  DKG  models.  The  I-RBDO  code  is  also  used  to  carry 
out  RBDO  in  this  paper. 


4.2.1  Variable  Screening 

At  the  baseline  design  d,  which  is  the  initial  design  as  shown  in  Table  9,  all  11  performance  measures  in  Table 
8  are  active.  That  is,  the  value  of  every  performance  measure  at  the  baseline  design  is  the  same  as  the  baseline 
values,  with  GrBaseline,,  r=l — 1 1 .  Therefore,  the  proposed  variable  screening  method  is  performed  for  the  problem 
at  the  baseline  design.  Parameters  ns,  nr0,  a,  p,  and  y  axo  set  as  50,  50,  0.05,  0.05,  and  1.0,  respectively.  Four 
hundred  eighty-four  (44  design  variables  x  11  performance  measures)  1-D  surrogate  models  with  quadratic 
interpolation  are  generated  using  89  DoE  samples  (i.e.,  simulation  samples).  It  is  noted  that  11  values  of 
performance  measures  are  obtained  from  one  analysis.  The  results  of  partial  output  variances  vt  are  listed  in  Table 
10  and  Table  11  for  every  performance  measure.  The  partial  output  variances  of  the  important  variables  for  each 
performance  measure  are  marked  with  bold  font.  It  is  noted  that  only  partial  output  variances  of  Xi~X8  are  listed  in 
Table  10  since  Gi~Gg  are  only  a  function  of  X\~X8  as  the  variable  screening  method  identified  the  partial  output 
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variances  to  be  zero  for  other  random  variables.  In  Table  1 1,  X6  and  X7  have  zero  partial  output  variances  as  GH)  and 
G 11  are  not  functions  of  X(,  and  X7.  In  the  last  three  rows  of  Table  10  and  Table  11,  the  sums  of  partial  output 
variances  of  selected  variables,  the  sums  of  all  partial  output  variances,  and  their  ratios  are  listed.  As  explained 
before,  this  is  the  estimated  ratio  between  the  captured  output  variance  in  selected  variables  to  the  total  output 
variance.  It  is  estimated  that  a  minimum  of  90.3%  of  the  total  output  variance  is  captured  in  the  selected  variables. 
In  total,  14  random  variables:  Xu  X2,  X3,  X4,  X5,  X6,  X7,  Xg,  Xi0,  X2o,  X23,  X25,  X26,  and  XNi,  are  selected  as  important 
variables.  Accordingly,  14  design  variables,  which  are  the  means  of  the  selected  random  variables,  are  considered 
as  important  design  variables. 


Table  10.  Partial  Output  Variances  v;  (G\~G9) 


RVs 

G] 

g2 

g3 

g4 

g5 

g6 

g7 

g8 

g9 

x, 

2.37E-02 

3.75E+00 

4.06E+01 

1.11E+01 

1.94E+00 

1.73E+00 

6.21E-01 

3.45E-01 

5.75E-01 

x2 

1.88E-02 

1.01E+00 

3.55E-01 

1.86E+00 

2.07E+00 

2.06E+00 

1.09E+00 

4.03E-01 

2.69E-01 

X, 

5.08E-05 

1.76E-02 

1.05E+01 

9.74E+00 

5.95E+00 

4.38E+00 

2.00E+00 

2.37E-01 

2.24E-02 

X4 

1.66E-04 

9.27E-02 

3.91E+00 

1.79E+00 

2.65E-01 

6.65E-02 

1.73E-03 

4.78E-03 

9.65E-03 

X 

1.12E-04 

1.37E-04 

2.74E+00 

2.23E+00 

4.90E-01 

9.11E-01 

9.30E-01 

1.79E-01 

7.94E-02 

4.58E-05 

2.77E+00 

1.64E-01 

8.14E-02 

8.18E-01 

1.60E-01 

1.70E-01 

3.95E-02 

3.27E-02 

X7 

2.32E-03 

1.34E-01 

1.91E-01 

1.50E-01 

2.75E+00 

1.71E+00 

5.66E-01 

1.10E-01 

2.77E-01 

X 

1.23E-03 

6.28E-02 

4.39E-02 

3.02E-01 

5.35E-01 

5.45E-01 

3.27E-01 

6.25E-02 

4.87E-02 

Selected 

4.6  IE— 02 

7.53E+00 

5.78E+01 

2.67E+01 

1.46E+01 

1.13E+01 

5.70E+00 

1.38E+00 

1.28E+00 

All 

4.64E-02 

7.84E+00 

5.85E+01 

2.73E+01 

1.48E+01 

1.16E+01 

5.71E+00 

1.38E+00 

1.31E+00 

Selec./All 

99.4% 

96.0% 

98.8% 

97.8% 

98.6% 

97.4% 

99.8% 

100.0% 

97.7% 

Table  11.  Partial  Output  Variances  vj.  (Gi0andGn) 


RVs 

G10 

G„ 

RVs 

G10 

Gn 

RVs 

G10 

Gn 

1  v, 

1.30E-03 

1.48E-03 

Xn 

4.24E-07 

2.01E-04 

XN g 

2.06E-05 

5.45E-05 

x2 

4.39E-05 

3.44E-05 

Xlg 

9.71E-06 

3.33E-05 

XNg 

1.61E-06 

1.89E-05 

V, 

8.95E-04 

3.05E-04 

Xyt 

2.21E-03 

1.84E-03 

XNl0 

5.11E-06 

1.83E-06 

x4 

8.74E-05 

2.63E-05 

Xn 

2.67E-03 

1.13E-03 

XNn 

1.56E-07 

1.12E— 07 

V5 

3.06E-04 

4.23E-05 

X24 

2.35E-05 

6.89E-05 

XNn 

1.83E-06 

3.10E-05 

V6 

0 

0 

x24 

4.85E-04 

7.31E-03 

XNn 

1.69E-07 

8.19E-07 

x7 

0 

0 

X26 

4.59E-03 

1.06E-02 

XNl4 

9.08E-08 

3.75E-05 

X, 

5.56E-04 

1.89E-04 

x27 

3.43E-05 

3.76E-04 

XNn 

1.84E-06 

1.66E-05 

Xw 

5.49E-04 

3.87E-04 

XNi 

9.33E-05 

6.38E-03 

XNX6 

4.35E-08 

2.88E-08 

Xn 

3.32E-05 

2.05E-04 

xn2 

4.49E-07 

7.48E-05 

XNn 

3.24E-07 

1.84E-06 

X\2 

9.35E-05 

3.98E-04 

xn3 

9.59E-08 

2.00E-05 

XNlg 

1.04E-07 

1.18E-07 

Xn 

7.23E-06 

3.82E-04 

xn4 

6.04E-08 

8.36E-05 

XNn 

3.03E-08 

3.37E-07 

Xu 

5.76E-07 

1.17E-05 

xn5 

2.60E-07 

1.42E-07 

XN2o 

2.06E-07 

2.98E-06 

Xl5 

1.16E-07 

1.13E-05 

xn6 

1.29E-07 

2.22E-07 

xn21 

2. 44  E— 07 

9.41E-06 

Xu 

9.14E-08 

1.63E-06 

xn7 

9.46E-08 

7.94E-06 

Selected 

1.33E-02 

2.87E-02 

All 

1.40E-02 

3.18E-02 

Selec./All 

95.0% 

90.3% 

Sensitivity-variance  introduced  in  Section  4. 1  is  applied  to  performance  measure  G5,  and  the  result  is  shown  in 
Table  12.  Selected  random  variables  for  G5  are  X\~X3  and  X5~Xg  out  of X\~Xg.  Among  Xi~Xs,  variables  X3~X7  have 
the  largest  standard  deviation  of  0.06.  However,  X4  is  not  selected  among  them  because  it  has  small  design 
sensitivity  compared  to  others.  Here,  the  design  sensitivities  are  calculated  at  the  design  point  using  the  forward 
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finite  difference  method  (FDM)  with  0.1%  perturbation.  By  contrast,  X%  is  selected  as  an  important  variable  even 
though  it  has  smallest  standard  deviation  of  0.03.  Again,  this  is  because  it  has  relatively  large  sensitivity  and 
induces  large  output  variances. 


Table  12.  Result  of  Sensitivity-Variance  Method  in  G5 


RVs 

STDEV 

(°v, ) 

Design  Sensitivity 

(«,) 

Partial  Output  Variance 

(«M,) 

v, 

0.05 

27.9105 

1.95E+00 

*2 

0.05 

29.2723 

2.14E+00 

*3 

0.06 

-40.7490 

5.98E+00 

X, 

0.06 

8.7917 

2.78E-01 

*5 

0.06 

-11.6078 

4.85E-01 

*6 

0.06 

14.1289 

7.19E-01 

X~] 

0.06 

27.8533 

2.79E+00 

Vs 

0.03 

24.3000 

5.31E-01 

Interestingly,  the  partial  output  variances  using  the  sensitivity-variance  method  of  G$  shown  in  Table  12  are 
close  to  the  result  shown  in  the  sixth  column  of  Table  10.  In  fact,  the  same  variables  as  those  in  the  variable 
screening  method  will  be  selected  by  using  the  sensitivity-variance  method  throughout  all  1 1  constraints.  However, 
the  sensitivity-variance  method  has  the  possibility  of  choosing  undesirable  variables  as  shown  in  the  analytical 
examples  in  Section  4.1.  In  Fig.  3,  the  shape  of  G5  when  each  V,  is  random  is  shown.  It  is  easily  anticipated  that  the 
design  sensitivity  of  G5  with  respect  to  X,  could  be  very  small  or  even  zero  so  that  the  sensitivity-variance  method 
may  provide  inaccurate  partial  output  variance;  this  can  be  prevented  if  the  variable  screening  method  is  used. 


Figure  3.  Shape  of  G5  when  Each  X,  is  Random 
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The  sensitivity-variance  method  requires  accurate  design  sensitivity.  In  practical  engineering  problems,  design 
sensitivity  might  be  calculated  using  FDM.  To  use  FDM,  a  user  determines  perturbation  method  (forward, 
backward  or  central)  and  perturbation  size,  and  the  result  of  the  sensitivity-variance  could  depend  on  the  user’s 
choice.  In  Table  13,  partial  output  variance  of  30  in  G(,  at  DDO  optimum  design  with  various  methods  is  shown.  It 
can  be  seen  that  X2  is  not  selected  as  an  important  variable  when  design  sensitivity  is  calculated  using  forward  FDM 
with  1%  perturbation.  The  partial  output  variance  is  only  56.6%  of  that  found  using  the  accurate  method  with 
100,000  realizations  of  X2.  To  obtain  more  accurate  design  sensitivity  with  forward  FDM,  small  perturbation  is 
required  as  shown  in  Table  13.  However,  small  perturbation  does  not  always  provide  accurate  design  sensitivity, 
and  determining  appropriate  perturbation  size  would  require  extra  DoE  samples.  Central  FDM  provides  more 
accurate  design  sensitivity,  and  it  is  insensitive  to  perturbation  size.  However,  the  partial  output  variance  with 
central  FDM  sensitivity  shows  at  most  76.7%  accuracy  compared  to  the  accurate  method.  It  is  noted  that  the 
variable  screening  method  would  not  require  perturbation  size  determination.  Moreover,  a  user  can  perform  the 
proposed  variable  screening  method  using  only  one  more  DoE  sample  than  the  sensitivity-variance  method  with 
central  FDM. 


Table  13.  Partial  Output  Variance  ofX2  in  G(,  at  DDO  Optimum  Design 


Method 

Perturbation 

Selection 

Design 

Sensitivity 

Partial  Output 
Variance 

Accuracy 

Forward,  1% 

No 

-5.83E+00 

8.49E-02 

56.6% 

Forward,  0.5% 

Yes 

-6.30E+00 

9.94E-02 

66.3% 

Sensitivity- 

Forward,  0.1% 

Yes 

-6.68E+00 

1.12E-01 

74.7% 

Variance 

Central,  1% 

Yes 

-6.79E+00 

1.15E-01 

76.7% 

Central,  0.5% 

Yes 

-6.78E+00 

1.15E-01 

76.7% 

Central,  0.1% 

Yes 

-6.78E+00 

1.15E-01 

76.7% 

Variable 

Screening 

- 

Yes 

- 

1.51  E — 0 1 

100.7% 

Accurate 

- 

Yes 

- 

1.50E-01 

100.0% 

Since  we  have  44-D  global  surrogate  models  for  this  example,  GSA  can  be  carried  out  to  verify  effectiveness  of 
the  proposed  method.  Among  various  GSA  methods,  the  global  sensitivity  index  method,  which  can  identify  the 
global  effect  of  the  variables  of  interest  on  the  output,  is  used  here.  The  main  strength  of  the  global  sensitivity  index 
method  is  that  it  can  find  interactions  between  all  variables  (not  statistical  correlation  between  random  variables). 
All  random  variables  are  assumed  to  follow  uniform  distribution  in  their  corresponding  design  domain  of  d;  and  d!/, 
and  global  sensitivity  indices  are  calculated  using  the  Monte  Carlo  simulation  (MCS)  method  with  1  million  MCS 
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samples  [31].  There  are  many  global  sensitivity  indices  in  this  44-D  problem;  the  total  sensitivity  index  S'ot  is  used 
for  variable  ranking  and  screening.  The  total  sensitivity  index  S'0'  is  a  “total  influence  of  /th  random  variable”  to  the 
output.  That  is,  it  indicates  the  main  effect  plus  interactions  of  the  ith  random  variable  with  other  random  variables 
[22].  The  results  are  listed  in  Table  14  and  Table  15.  To  identify  important  random  variables,  the  mean  value  of 
S'0'  is  calculated  for  each  constraint  and  the  random  variable,  which  yields  larger  S'01  than  the  mean  value,  is 
selected  as  an  important  variable  and  marked  in  bold  font  in  these  tables.  In  Table  14,  only  5“  for  X,  ~X8  are  listed 
as  S'0'  for  other  variables  are  zero.  Also,  the  sum  of  S'0'  for  the  selected  random  variables,  the  sum  of  S'0'  for  all 
random  variables,  and  their  ratios  are  listed  in  the  last  three  rows,  respectively. 


Table  14.  Global  Sensitivity  Indices  S'ot  (G,~G9) 


RVs 

G, 

g3 

g4 

Gs 

g6 

g7 

Gs 

Gg 

A, 

4.16E-01 

3.84E-01 

5.23E-01 

2.68E-01 

1.74E-01 

1.89E-01 

2.07E-01 

2.56E-01 

4.35E-01 

1  X, 

3.46E-01 

1.18E-01 

1.04E-01 

1.88E-01 

1.98E-01 

2.09E-01 

1.66E-01 

2.62E-01 

2.07E-01 

X, 

7.83E-03 

5.75E-03 

3.28E-01 

5.07E-01 

5.18E-01 

4.97E-01 

4.92E-01 

2.69E-01 

4.96E-02 

1  X4 

1.89E-02 

2.23E-02 

7.92E-02 

1.34E-01 

1.25E-01 

1.38E-01 

1.42E-01 

7.38E-02 

2.87E-02 

1  X, 

7.50E-03 

-7.22E-04 

1.40E-01 

2.10E-01 

1.53E-01 

1.76E-01 

1.78E-01 

1.90E-01 

9.10E-02 

x„ 

5.65E-02 

2.67E-01 

2.08E-02 

5.01E-02 

1.11E-01 

1.15E-01 

1.67E-01 

5.13E-02 

2.74E-02 

X7 

8.13E-02 

5.65E-03 

1.77E-02 

4.52E-02 

1.59E-01 

1.60E-01 

1.76E-01 

6.73E-02 

1.73E-01 

Xs 

3.23E-02 

2.06E-02 

4.02E-03 

2.33E-02 

3.82E-02 

4.09E-02 

5.21E-02 

7.13E-02 

6.14E-02 

Selected 

9.32E-01 

8.12E-01 

1.17E+00 

1.40E+00 

1.48E+00 

1.52E+00 

1.58E+00 

1.24E+00 

1.07E+00 

All 

9.66E-01 

8.23E-01 

1.22E+00 

1.43E+00 

1.48E+00 

1.52E+00 

1.58E+00 

1.24E+00 

1.07E+00 

Selec./All 

96.5% 

98.7% 

95.9% 

97.9% 

100.0% 

100.0% 

100.0% 

100.0% 

100.0% 

Table  15.  Global  Sensitivity  Indices  S'°'  (GioandG,,) 


RVs 

G,„ 

Gu 

RVs 

G,„ 

Gu 

RVs 

G,0 

G, , 

X, 

1.05E-01 

7.09E-02 

X\7 

3.30E-04 

1.46E-02 

XX8 

-8.85E-04 

-1.34E-03 

X2 

1.01E-03 

2.38E-03 

Xl8 

-4.77E-04 

-2.50E-04 

XNg 

8.54E-04 

-1.71E-03 

X, 

5.72E-02 

1.01E-02 

Xjo 

2.07E-01 

9.17E-02 

xx,  o 

1.28E-04 

-1.07E-04 

1  x4 

8.66E-03 

2.65E-03 

Xj3 

1.34E-01 

3.57E-02 

XX, , 

-2.72E-04 

-2.85E-05 

X5 

3.29E-02 

5.69E-03 

X24 

-1.60E-03 

-8.39E-04 

XX,  2 

-1.51E-04 

-1.94E-04 

X6 

0 

0 

X25 

1.12E-02 

9.79E-02 

XX, , 

-5.24E-05 

1.59E-04 

X7 

0 

0 

X26 

2.96E-01 

3.80E-01 

XX,4 

-3.88E-05 

-6.91E-05 

Xs 

4.73E-02 

5.89E-03 

x27 

6.04EX)3 

2.38E-02 

XX,  5 

1.20E-04 

4.59E-04 

X,„ 

5.98E-02 

1.99E-02 

XX, 

5.62E-03 

1.65E-01 

XX,6 

8.87E-05 

2.85E-04 

X,, 

3.86E-03 

1.67E-02 

xx2 

X.11E-04 

2.3  IE-03 

XX,  7 

-6.09E-05 

-3.61E-04 

X12 

1.34E-02 

2.88E-02 

xx3 

1.13E-04 

1.33E-03 

xx,8 

X.12E-05 

-9.18E-05 

X,3 

3.20E-04 

1.50E-02 

xx4 

1.54E-04 

-2.50E-03 

xx,9 

1.10E-04 

1.23E-03 

X,4 

2.24E-04 

-4.65E-04 

xn5 

-2.96E-04 

9.10E-04 

XX2  o 

-1.67E-04 

-8.38E-04 

X„ 

-1.33E-04 

1.36E-03 

XX6 

2.34E-04 

-2.32E-05 

xx2. 

6.77E-05 

-6.44E-04 

X]6 

-1.94E-04 

2.42E-03 

XN-j 

-1.61E-05 

-3.27E-04 

Selected 

9.39E-01 

8.94E-01 

All 

9.87E-01 

9.87E-01 

Selec./All 

95.1% 

90.6% 
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Using  the  global  sensitivity  index  method,  16  random  variables  are  selected  as  shown  in  Table  16.  Those  16 
random  variables  include  all  14  random  variables  selected  using  the  proposed  method  as  shown  in  Table  16. 
Moreover,  if  we  set  a  limit  of  14  random  variables  to  be  selected,  Xu  and  X27  will  not  be  selected  as  they  have  least 
S “  among  the  selected  variables  for  Gn  as  shown  in  Table  15.  Thus,  the  14  random  variables  selected  by  both 
methods  are  identical.  The  ratio  between  the  sensitivity  indices  of  selected  variables  and  all  variables  has  no 
physical  meaning.  However,  it  is  an  indicator  that  shows  how  much  variance  is  captured  by  the  selected  random 
variables.  The  results  are  quite  similar  to  those  of  the  proposed  method  as  shown  in  Tables  10  and  1 1  and  Tables  14 
and  15,  respectively.  Hence,  it  is  demonstrated  that  the  proposed  variable  screening  method  is  quite  effective  even 
though  it  does  not  require  global  surrogate  models  unlike  the  global  sensitivity  index  method. 


Table  16.  Selected  Random  Variables 


Method 

Selected  Variables 

Proposed  Method 

X\,  X2,  X3,  X4,  X 5,  X$,  X7,  Xg,  Xio,  X20, 
X22,X25,X2(,XN1  (14  RVs) 

Global  Sensitivity  Index 

Xu  x2,  X3,  X4,  X5,  X6,  X7,  Xg,  xl0,  xl2, 
X20,  X23,  X25,  X26,  X21,  XN,  (16  RVs) 

4.2.2  Reliability-based  Design  Optimization 
For  this  example,  RBDO  is  formulated  as 

minimize  Weight  (d) 

P  \  G.  (X)  >  Baseline,.  1  <  10%,  i  =  1 . 11 

subject  to  L  '  ,J  (30) 

di<d<d1',  d  e  □  NDV ,  and  X£Dm 

For  a  comparison  study,  we  considered  three  cases:  (1)  a  set  of  14  random  variables  are  selected  based  on 
experience  without  using  the  proposed  variable  screening  method,  (2)  another  set  of  14  random  variables  are 
selected  using  the  proposed  variable  screening  method  as  shown  in  Section  4.2.1,  and  (3)  in  addition  to  the  14 
random  variables  selected  in  case  2,  four  more  random  variables  are  selected  using  the  cost  function  sensitivity  for  a 
total  of  1 8  design  variables  to  test  the  effectiveness  of  the  proposed  variable  screening  method  and  accuracy  of  the  I- 
RBDO  code.  The  selected  design  variables  are  listed  in  Table  17. 


Table  17.  Selected  Random  Variables  for  RBDO 


Cases 

Common  Selection 

Different  Selection 

(1 )  Based  on  Experience 

XNg,  XNl0,  XNn 

(2)  Variable  Screening 

X„X2,X3,X4,X5,X6,X7,XS, 

Xio,  X23,  X26 

(3)  Variable  Screening 
+  Cost  Function 

X20,X25,XNi  (11  RVs) 

Xio,  X23,X26 
+  XN4,XNg,XNl0,XNu 

Because  the  cost  function,  which  is  weight  in  this  problem,  is  a  function  of  design  variables  d,  not  random 
variables  X,  the  function  is  deterministic.  Therefore,  the  design  sensitivity  of  the  cost  function  with  respect  to  the 
design  variable  is  calculated  by  FDM,  and  the  four  design  variables  (and  related  random  variables  XN4,  XNg,  XN m, 
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and  XNn)  that  show  the  largest  sensitivity  among  the  unselected  design  variables  are  chosen.  Then,  RBDO  is 
carried  out  with  three  sets  of  selected  random  variables.  The  optimum  design  results  are  summarized  in  Table  18. 
The  bold  font  indicates  chosen  design  variables,  and  others  are  fixed  at  the  baseline  design  value.  Also,  probabilities 
of  failure,  cost  function  values,  and  design  iteration  details  are  listed  in  Table  19.  All  RBDOs  are  carried  out  using 
I-RBDO  code  with  500,000  MCS  samples  [10].  For  the  three  cases,  I-RBDO  generates  DKG  surrogate  models, 
using  the  sample  responses  obtained  from  the  44-D  global  surrogate  models,  which  are  treated  as  true  responses,  for 
RBDO.  Since  I-RBDO  is  able  to  carry  out  RBDO  using  the  surrogate  models  generated  by  other  methods  (i.e.,  the 
surrogate  models  generated  by  Ford  in  this  example)  without  having  information  on  how  the  surrogate  models  are 
generated,  the  full-dimensional  RBDO  is  performed  as  well  using  the  44-D  surrogate  models.  The  44-D  RBDO 
result  is  treated  as  the  true  RBDO  optimum  and  used  for  the  purpose  of  validation  of  RBDO  results  obtained  for  the 
three  reduced-dimensional  cases. 

Indeed,  the  optimum  design  values  for  dt~ds  are  very  close  to  the  full-dimensional  44-D  case,  as  shown  in 
Table  18,  which  shows  that  the  DKG  surrogate  models  generated  in  the  I-RBDO  code  are  accurate.  In  the  three 
cases,  the  random  variables  X\~X%  are  selected  because  they  have  large  partial  output  variance  for  performance 
measures  Gi~G9.  This  means  that  they  contribute  a  large  portion  of  the  output  variance.  Hence,  finding  optimum 
values  for  them  is  the  most  effective  way  to  reduce  the  probabilities  of  failure  of  Gi~G9.  Similarly,  d2 2 
(corresponding  to  X26)  moves  to  the  upper  bound  of  1.1  when  it  is  selected  because  it  has  the  largest  partial  output 
variances  of  G10  and  Gn.  The  design  variables  d27,  d32,  t/33i  and  <f34  (corresponding  to  XN4,  XN9,  XNU)  and  XNn), 
which  are  selected  by  the  sensitivity  of  the  cost  function,  move  to  their  lower  bounds  of  0.7,  0.6,  0.6,  and  0.9, 
respectively,  to  reduce  the  cost  function  without  significantly  affecting  the  reliability  of  the  optimum  design.  On  the 
other  hand,  even  though  some  design  variables  are  selected  due  to  the  partial  output  variances  of  some  constraints, 
they  move  to  their  lower  bounds.  For  example,  d2 4  (corresponding  to  XNi)  moves  to  the  lower  bound  of  0.7  because 
it  has  the  largest  sensitivity  for  the  cost  function,  even  though  it  has  the  third-largest  partial  output  variance  of  Gn. 
That  is,  via  the  trade-offs  in  the  optimization  process,  it  is  moved  to  the  lower  bound  to  minimize  the  cost  function 
rather  than  to  reduce  the  probability  of  failure. 

In  Table  19,  the  number  of  black  box  calls  is  listed.  I-RBDO  can  deal  with  multiple  samples  simultaneously  to 
create  a  surrogate  model  to  utilize  parallel  computing.  That  is,  one  black-box  call  requests  computational 
simulations  at  a  number  of  sampling  points;  thus,  the  black  box  call  indicates  the  clock  time  required  for  analyses 
for  RBDO.  Therefore,  the  number  of  black  box  calls  represents  actual  computational  cost  more  realistically  than  the 
number  of  CAEs  when  parallel  computing  is  used.  In  this  example,  five  samples  are  added  to  the  DKG  model  at  a 
time;  therefore,  the  total  number  of  CAEs  is  roughly  five  times  the  number  of  black  box  calls. 

To  verify  once  again  that  the  surrogate  model  generated  by  DKG  using  I-RBDO  is  accurate,  the  same  three 
cases  are  performed  using  I-RBDO  by  using  responses  from  44-D  global  surrogate  model  directly  while  fixing 
screened-out  variables  at  their  baseline  design  points.  As  shown  in  Table  20,  the  optimums  found  using  DKG  and 
44-D  global  surrogate  models  are  very  close  to  each  other.  Hence,  it  is  confirmed  that  DKG  in  I-RBDO  generated 
an  accurate  surrogate  model.  Moreover,  it  is  also  verified  that  RBDO  can  be  conducted  based  on  an  accurate 
surrogate  model  even  for  a  moderately  large-dimensional  problem  (14  and  18  dimensions). 
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For  the  three  reduced-dimensional  cases,  the  probabilities  of  failure  are  calculated  using  only  selected  variables 
as  random  variables,  since  the  other  design  variables  are  treated  as  deterministic  as  explained  in  Section  2.2  with 
fixed  values  at  the  baseline  design.  As  shown  in  Table  19,  it  is  noted  that,  in  all  cases,  the  target  design  constraints 
of  10%  probability  failure  are  closely  satisfied  as  expected  at  these  optimum  designs  since  the  RBDO  considers  only 
the  selected  variables  as  random  variables.  On  the  other  hand,  to  check  correct  reliabilities,  reliability  analyses  are 
carried  out  at  these  optimum  designs  treating  all  variables  as  random  for  the  44-D  surrogate  models  and  MCS  with  1 
million  samples,  as  shown  in  Table  21.  It  is  noted  that  probabilities  of  failure  of  full-dimensional  optimum  in  Table 
19  and  Table  21  are  different  even  though  the  discrepancy  is  negligible.  Theoretically,  they  should  be  the  same; 
however,  they  are  not  equal  because  different  numbers  of  MCS  samples  (500,000  and  1  million)  are  used  and  MCS 
error  is  induced.  At  the  baseline  design,  all  constraints  have  approximately  50%  of  probability  of  failure,  and  this  is 
reasonable  because  all  constraints  are  active  at  the  baseline  design.  However,  they  are  not  exactly  50%  because  the 
constraint  functions  are  nonlinear.  Probabilistic  constraint  results  corresponding  to  G\~Gg  are  active  or  feasible  in 
both  Table  19  and  Table  21.  Due  to  the  fact  that  G\  to  Gg  are  functions  of  X\  to  Xg,  and  all  of  them  are  selected  as 
important  variables,  the  RBDO  result  with  reduced-dimension  and  the  reliability  analysis  result  with  full-dimension 
are  very  close  to  each  other,  considering  MCS  errors.  The  constraint  G 1 0  shows  inactive  results  regardless  of  which 
selection  design  variable  set  is  used. 

All  probabilities  of  failure  for  the  constraint  Gn  in  Table  19  satisfy  the  target  probability  of  failure  10%,  which 
makes  it  obvious  that  these  are  the  reliability  analyses  results  of  reduced-dimensional  problems.  However,  full¬ 
dimensional  reliability  analyses  at  optimum  designs  show  quite  different  values  as  shown  in  Table  21.  Selection 
based  on  experience  shows  a  17.70%  probability  of  failure,  which  violates  the  target  reliability  significantly.  The 
variables  selected  based  on  experience  contain  only  55.40%  of  the  total  output  variance  of  Gn.  Hence,  it  cannot 
find  any  safe  design  once  dimension  is  reduced.  On  the  other  hand,  the  probabilities  of  failure  for  the  proposed 
variable  screening  method  and  also  the  one  considering  cost  function  are  close  to  the  target  probability  of  failure. 
The  selected  variables  contain  93.55%  and  93.88%  of  the  total  output  variance  of  Gn  for  Case  2  and  Case  3, 
respectively.  Hence  it  can  find  a  correct  optimum  even  with  reduced  dimension. 


Table  18.  RBDO  Optimum  Design 


Design 

Variables 

Corresponding 

RVs 

Baseline 

Design 

Based  on 
Experience 

Variable 

Screening 

Variable  Screening 
+  Cost  Function 

Full 

Dimension 

d  i 

1.9 

1.8343 

1.8366 

1.8336 

1.8491 

di 

*2 

1.91 

2.1810 

2.1804 

2.1806 

2.1692 

dg 

*3 

2.51 

2.8528 

2.8561 

2.8540 

2.8803 

dg 

x4 

2.4 

1.9817 

1.9810 

1.9856 

1.9507 

ds 

*5 

2.55 

2.7195 

2.7228 

2.7261 

2.7296 

d6 

*6 

2.25 

2.2543 

2.2497 

2.2558 

2.2408 

dj 

*7 

2.25 

2.3199 

2.3185 

2.3207 

2.3284 

dg 

X% 

1.5 

1.7904 

1.7966 

1.7860 

1.8 

dg 

Xn 

1.28 

1.28 

0.9 

0.9 

1.5336 

dio 

Xu 

1.4 

1.4 

1.4 

1.4 

1.0038 

du 

X\2 

1.1 

1.1 

1.1 

1.1 

0.9868 

d\2 

Xn 

2.2 

2.2 

2.2 

2.2 

1.7006 

dn 

Xu 

1.5 

1.5 

1.5 

1.5 

1.2 

d  14 

Xu 

1.25 

1.25 

1.25 

1.25 

0.9 

dis 

Xk, 

2.5 

2.5 

2.5 

2.5 

2.0 
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die 

*17 

2.0 

2.0 

2.0 

2.0 

1.5033 

dn 

Xl8 

1.4 

1.4 

1.4 

1.4 

1.1 

dn 

xw 

1.22 

0.9 

0.9 

0.9 

0.9 

dig 

*23 

0.75 

0.75 

0.6 

0.6 

0.6 

dig 

*24 

1.9 

1.9 

1.9 

1.9 

1.5 

d2i 

*25 

0.65 

0.6897 

0.5429 

0.5871 

0.8 

d22 

*26 

0.85 

0.85 

1.1 

1.1 

1.1 

d22> 

*27 

0.85 

0.85 

0.85 

0.85 

1.1 

d2  4 

XNi 

0.9 

0.7 

0.7 

0.7 

0.7 

d25 

xn2 

1.1 

1.1 

1.1 

1.1 

0.8 

d26 

xn3 

1.55 

1.55 

1.55 

1.55 

1.2039 

d2i 

xn4 

0.9 

0.9 

0.9 

0.7 

0.7 

d28 

xn5 

1.5 

1.5 

1.5 

1.5 

1.2 

d2  9 

xn6 

1.2 

1.2 

1.2 

1.2 

0.9 

d3o 

XNj 

1.1 

1.1 

1.1 

1.1 

0.8 

dn 

XN g 

1.52 

1.52 

1.52 

1.52 

1.6925 

d22 

XNg 

0.8 

0.6 

0.8 

0.6 

0.6 

dn 

**10 

0.8 

0.6 

0.8 

0.6 

0.6 

d24 

**11 

1.2 

0.9 

1.2 

0.9 

0.9 

d25 

XNn 

0.75 

0.75 

0.75 

0.75 

0.6 

d26 

XNU 

0.75 

0.75 

0.75 

0.75 

0.6 

d2i 

**14 

0.75 

0.75 

0.75 

0.75 

0.6 

d28 

**15 

1.0 

1.0 

1.0 

1.0 

0.8 

d2g 

**16 

1.14 

1.14 

1.14 

1.14 

0.9083 

dgo 

XNn 

1.2 

1.2 

1.2 

1.2 

0.9 

dgi 

**18 

1.4 

1.4 

1.4 

1.4 

1.1 

dni 

XNig 

1.2 

1.2 

1.2 

1.2 

0.9 

dn 

**20 

1.4 

1.4 

1.4 

1.4 

1.1 

dg4 

XN2l 

2.13 

2.13 

2.13 

2.13 

1.7 
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Table  19.  Cost,  Probabilities  of  Failure  at  RBDO  Optimum  Design,  and  Optimization  Details 


Perfonnance 

Based  on 

Variable 

Variable  Screening 

Full 

Measure 

Experience 

Screening 

+  Cost  Function 

Dimension 

Cost 

259.83 

244.17 

225.68 

G, 

10.06% 

9.94% 

9.96% 

9.96% 

g2 

10.03% 

10.11% 

10.04% 

9.99% 

g3 

g4 

g5 

1.96% 

1.95% 

1.91% 

g6 

9.95% 

10.01% 

10.02% 

9.99% 

g7 

9.89% 

9.93% 

9.95% 

10.01% 

g8 

10.03% 

9.98% 

9.91% 

g9 

10.04% 

9.99% 

9.87% 

9.92% 

Gio 

G„ 

9.97% 

9.93% 

No.  of  Design 
Iter. 

20 

30 

20 

21 

No.  of  CAEs 

2,666 

4,358 

3,306 

- 

No.  of  Black 
Box  Calls 

473 

727 

573 

- 

Table  20.  RBDO  Optimum  Design  with  1-RBDO  and  True  Model 


Design 

Variables 

Corresponding 

RVs 

Based  on  Experience 

Variable  Screening 

Var.  Screen.  +  Cost  Fn. 

I-RBDO 

44-D  Surrogate 

I-RBDO 

44-D  Surrogate 

I-RBDO 

44-D  Surrogate 

d\ 

A, 

1.8343 

1.8371 

1.8366 

1.8359 

1.8336 

1.8425 

d 2 

A2 

2.1810 

2.1799 

2.1804 

2.1807 

2.1806 

2.1771 

di 

A3 

2.8528 

2.8578 

2.8561 

2.8576 

2.8540 

2.8654 

d4 

X, 

1.9817 

1.9807 

1.9810 

1.9851 

1.9856 

1.9525 

ds 

As 

2.7195 

2.7222 

2.7228 

2.7233 

2.7261 

2.7209 

d(, 

A6 

2.2543 

2.2488 

2.2497 

2.2501 

2.2558 

2.2464 

d-j 

a7 

2.3199 

2.3173 

2.3185 

2.3169 

2.3207 

2.3265 

dg 

a8 

1.7904 

1.8 

1.7966 

1.7985 

1.786 

1.8 

dg 

Aio 

- 

- 

0.9 

0.9 

0.9 

0.9 

d  18 

A>o 

0.9 

0.9 

0.9 

0.9 

0.9 

0.9 

d  19 

a23 

- 

- 

0.6 

0.6 

0.6 

0.6 

di\ 

A25 

0.6897 

0.6875 

0.5429 

0.5424 

0.5871 

0.5826 

dii 

Xl6 

- 

- 

1.1 

1.1 

1.1 

1.1 

dn 

XNi 

0.7 

0.7 

0.7 

0.7 

0.7 

0.7 

dn 

xn4 

- 

- 

- 

- 

0.7 

0.7 

dgi 

XNg 

0.6 

0.6 

- 

- 

0.6 

0.6 

dn 

AV10 

0.6 

0.6 

- 

- 

0.6 

0.6 

d}4 

XNU 

0.9 

0.9 

- 

- 

0.9 

0.9 
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Table  21.  Reliability  Analysis  Result  Using  Full-Dimensional  Surrogate  Model 


Perfomiance 

Measure 

Baseline 

Design 

Based  on 
Experience 

Variable 

Screening 

Variable  Screening 
+  Cost  Function 

Full 

Dimension 

Cost 

269.47 

249.10 

259.83 

244.17 

225.68 

Gi 

48.25% 

10.06% 

9.96% 

10.00% 

10.05% 

g2 

51.34% 

10.02% 

10.11% 

10.04% 

10.09% 

g3 

54.14% 

0.00% 

0.00% 

0.00% 

0.00% 

g4 

55.57% 

0.10% 

0.12% 

0.09% 

0.12% 

g5 

58.94% 

1.96% 

1.93% 

1.98% 

1.91% 

g6 

59.70% 

10.08% 

10.05% 

10.05% 

10.00% 

g7 

59.86% 

10.20% 

10.04% 

9.91% 

10.06% 

Gg 

53.23% 

10.02% 

10.03% 

9.97% 

9.14% 

g9 

51.15% 

10.02% 

9.96% 

9.96% 

9.96% 

G  io 

49.10% 

0.00% 

0.00% 

0.00% 

0.00% 

Gn 

52.46% 

17.70% 

11.23% 

11.17% 

10.05% 

5.  Conclusion 

A  new  efficient  and  effective  variable  screening  method  for  RBDO  is  proposed  in  this  paper.  For  the  proposed 
methods,  the  output  variance  is  used  as  a  measure  that  can  identify  important  design  variables.  Thus,  a  partial 
output  variance  based  on  the  univariate  DRM  is  proposed  to  approximate  the  output  variance  efficiently  and  to 
identify  the  design  variables  that  affect  output  variance  more  significantly  than  others.  The  univariate  DRM  and 
partial  output  variance  only  require  multiple  1-D  surrogate  models,  which  is  much  more  efficient  than  the  full¬ 
dimensional  surrogate  models.  Flence,  the  proposed  method  has  great  merit  in  efficiency  as  well  as  effectiveness. 
To  reduce  computational  time  and  maintain  a  user-specified  statistical  error  level,  hypothesis  testing  is  used  in  the 
variable  screening  process.  Also,  a  required  minimum  number  of  samples  for  calculating  the  correct  output  variance 
is  proposed  using  the  user-specified  error  level.  In  addition,  the  quadratic  interpolation  method  is  tailored  to  be 
applied  to  efficient  partial  output  variance  calculation. 

Two  analytical  examples  and  a  44-D  industrial  example  are  used  to  verify  the  performance  of  the  proposed 
variable  screening  method.  Through  the  analytical  examples,  it  is  shown  that  at  least  quadratic  approximation  is 
required  for  the  1-D  surrogate  model  and  that  partial  output  variance  is  a  good  measure  that  successfully  identifies 
important  variables.  In  the  industrial  example,  14  design  variables  out  of  44  are  selected  by  considering  the  output 
variances  of  11  constraints.  For  comparison,  another  14  design  variables  selected  based  on  experience  are  used.  In 
addition,  18  design  variables  are  selected  by  adding  four  design  variables,  which  affect  the  objective  function 
significantly  while  not  affecting  the  output  variances  much,  to  the  14  design  variables  previously  selected  with  the 
proposed  method.  The  selection  based  on  experience  shows  a  7.6%  reduced  cost  value,  whereas  the  target 
probability  of  failure  is  violated  by  77%.  However,  selection  by  the  proposed  method  shows  only  a  12.3% 
disagreement  of  target  value  and  a  3.6%  reduced  cost  value.  Moreover,  the  selection  of  18  design  variables  shows 
11.7%  target  disagreement  as  well  as  9.4%  reduced  cost  value.  Therefore,  the  performance  of  the  proposed  variable 
screening  method  is  verified. 
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